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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: INSTITUT PASTEUR 

(B) STREET: 28 RUE DU DOCTEUR ROUX 

(C) CITY: PARIS CEDEX 15 

(E) COUNTRY: FRANCE 

(F) POSTAL CODE (ZIP) : 75724 

(ii) TITLE OF INVENTION: A METHOD FOR ISOLATING A POLYNUCLEOTIDE OF 
INTEREST FROM THE GENOME OF A MYCOBACTERIUM USING A 
BAC-BASED DMA LIBRARY. APPLICATION TO THE DETECTION OF 
MYCOBACTERIA. 

(iii) NUMBER OF SEQUENCES: 5 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: PatentIn Release #1.0, Version #1.30 (EPO) 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12732 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) T^NTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 



ACCTGCGCTT 


GCAGAGATCA 


AATAGGGCGC 


ATGGGTCAGC 


ATAGTACAGG 


TCGTCGCGCA 


60 


TCTTTGATGC 


ATCGGAATAA 


GATGTCAGGC 


AATTAAAAGA 


GAAGCCACGG 


CGACTCGCGG 


120 


CATTCAGCAT 


GTCGAGCGTC 


GCTTCGATGT 


GAGCGCACCA 


TTCCGTGTCC 


AACGATTTCA 


180 


GACGAACATT 


GAATATTCCA 


CTCGCGACGC 


TATAGTCCGC 


CTCCCGATCT 


ATGCGCGCCG 


240 


CGCAGATGAA 


GTCTGCGTTC 


GCCCGACCTT 


CGAAACGTAG 


TGCGGCCGCG 


CGCACCATTT 


300 


CGGGGGAGAC 


GTCGATGCCG 


GTGTAATCAG 


TTTTGAAGCC 


ACGCGCATCT 


AGGTAGTCCA 


360 


GTAGAGCCCC 


ATAGCCACAG 


CCTAGATCGT 


TGATCGAAAA 


TGGGTCCGCC 


GCATTGACAA 


420 


TGCGCACCAG 


CTGGTCAAAG 


CGCAACGCCT 


GCCCGGCTTC 


GCCGTTCCAA 


TCGACGCCGC 


480 


GCGGGTGCCG 


TGTGCTTCGA 


GTTTCGATGC 


GTAGTJ=ACGG 


GCCACGTCAG 


CGAGCATGGT 


540 
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CGTTGCGTCT TCCGCCATGA AGCTGCCTCA CGATTTGTGT GTGTGGGCGT CGGTGCGTGG 600 

GTCCGAGACT ATACCTTCAA CAGTTGCATG CCGAGGCTGC GGCGGGCAAT GACCCAAAAA 660 
CCCGCCGGCA CGGTTCGCCG AGCT^GGAAG CGTGGAGACG ATAGATAATT TCACTGGCGA . 720 

CAGTACCTCA AATAGTCCGG AGCCTCGGCT CCGACGTTAA AGAGCAGATC CAGAATCGAC 780 

ACGGCGGGCT CGAACCCTCC CCACAATTGC TTATAATCGC GGTAGCCGTC ATAATCGAAC 840 

CAAGTTACCC GGATGCTAAG TTCGTCGT^C ACGCGCTCAT CGACATACGA ACGGGCTGAG 900 

GGGCCAGAGA CATATTCGGT CGCTGCGGCC TGTTGGCAGA GGTTGGCCAG TCTCTCGGTC 960 

TTGCCGTCGG CTAATTCGTA GTCCCACGAA TTTGCCAGTC GCGTGCTGAT ACCGAGATAA 1020 

CTGCAAATCG CATTCAATAG ACGCCTGTTG AGTAAGGAAA GATTCGTGTG CTGTTCTTCG 1080 

AGGTAAATCG GCGCGAGCCA GTCAGCGATC TCCGCAAAAT GAGCGGCCGC GCTGTAGTTG 1140 

AATTCTAGTG CCCGCCAGTG CGCTTTCGCC CAATCGGTGC CGTCGATCAG CGTCTCACGT 1200 

ATCTTTTGAT GGAAACGTCC CTTCACCTGG ACGGGAACAG TTATCCACTG TAACCCCTGG 1260 

CTCGTTTTGA TCCGATTTCT GTTTCGCCAA TCACGCTTGG TATATTGCAT GTCATCATAG 132 0 

ATGATGAATT CATCGACGAA TGCAATCAGG TC7VAAATATC CTCGCCAAGG TATGTAATTT 1380 

GATTGAACAA TCGCGACTTT CTTCAACGCG GTGTCTCCAA TTTAGAATAA CAAATACGTC 1440 

GCGCCCGCGA CAGCTCCGCT GGAGCGAGTT CAAGCGATTC TGCGACATAT TCAATATGGT 1500 

GCTCGGGAAG GCCAGGATGG GCCGCGACCC GGGGCGTCCG GTGCGCGATG AACGTCGCAT 1560 

CGTCTCCTGT GAGATAATTG CATCCGATCA TATAGGGCTG GCTGCGGCTA GGTTGCTGGC 1620 

AAAAAGATAT CGCGGCCX5AT CCGTTTCTGG TTTTGTCTTG ATGATCAAAT CCGCTTCCGT 1680 

TCACGAGATC GATTCCTGGT CTTCCCCCAG CGTCGCGATG TCGATAGGTG TCGCGCTTTG 1740 

TTCGTACCCG CACTACGCGG CGGCGAGAAC CTCGCCACCG AATCGGGATT GGGGGGAGGA 1800 

TACCACTCGG TCGAGGCCCG TCACCGGCCT TCTAGCGGGT TGACCATCAG TGTTTGCAGG 1860 

GCCCTATCCC GGTATGGCGC ACCACGGGAT CGGCAGCGTT CCGGTTGCTG GCGTGGTACC 1920 

TCGTTGTGGC GCCGTGGTCC ATGTCGATTG AGTGCGTGGA TCAGTGTAAA CCGTTGCGCG 1980 

CCATGTTCTG TAGGCACTGG TTCGGGTTGT GGTTAGGCTG CACGGTTGGC AGGTTACCAA 2040 

CCACTGAGCC CCTGGGCGGA TGTGAGCTCG GACTCCGCCT ATGGGGTGTA ATTTTGGCAG 2100 

ATTGGGCCGG GTCCCCGTGG TGAGGACTCC TCAACCGGAT TGGGTAAGCA TGAGGTGGTG 2160 

CTGGCAGCGG TGTCCTGGTC GCTCTCCCGA GTAGGCCCGT TGTGACTGTC ATGTGGGCGA 2220 
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GCGGGTTTGC GCGCGTAGGA GACGATGATT ACTACGCACG TGACCAACCA CAAGAACGGT 
GCCCATGTCA CCGTGGTGAA AACGAGTGGC GTGGTACCGA CTACCCCTTT GGCTCCCAGC 
TGTCCATAGA GCGGCACGTA GAACGGCTGG CCCGGGACCG CGACGTTGAC GATGCTCAGC 
GCCACGGCCA AACTCACGCA GACGCCGACC GCGCGGCGGC GGTCTCCATG GGCTGCGAGT 
TGGTCGAATA TCCCAGCACC AGGAGGCCCG TTGGGGTCTC GGGCTACCAG TGCAGCGATT 
GGCAAGACGA AAACGAGATA GTAGAAGGCG ACGTCCGCGG GGGAGAAGGT GGGGGTGGCG 
AGCAACACAA TCCCCACCAT GACAGGCGGG ATACGGCGTC CGAGCGCCAG CACGGCGACC 
ACGACTATGA CTAGGACAGC AAACCCGATC TGCGTTCGCG GACCAGTGAG GAAACCCTCT 
GGGATCTTGC CCGATTGATA GTTCTTGATG CTATCGGGGA TCAGCAGGAG TGCCTTGCCA 
AAGGACACGT TCCGCGGGTC TCGAAGCCCT CCGT^CGAAC TATTGAACTT GATGATGCCG 
TGGATCGACT GTGCGATCGT CCCCGGGAAG CCTCGTGGCC ACAACAGATUV GGCTGCGATA 
TTGGACACCA CCACGCCGGT GATCCCGATA CCAGCCCACC GCCATTGTCG AGCCGCCAAC 
AACACCACGC CGAGAACX3AC GAACTGCGGC TTTACCAGGA CGGCCAAGAT CACCGTGATG 
GTGGCGAGGC CCCACCGCTG TCGGGACAAC GCCACGAAGT AAGCCAGCGC GATCG6TACC 
ACGAACCCTG TCGAGTTGCC TCGATCGATG ACCCCCCACG CCGGGATGGC CGCGGCGCCC 
AGTGTCACGA AGATGACCAC TCGCTCCAGA CCACGTGCCC CCCGGGCCGC CCAGATGGCG 
GGAGATATGA CCGCCATCGT TAGGGCGACC AGGTAACAGA TCAGCCCCAA GCGCGGCGCA 
CCCAGCCAAT GGCTGGGTAG TCCGAAAATC GCATACGGTA TGCGGGCGGG GGCCCATGCA 
GCAACCGCGG TCGGCTGGTA ATCGGCGGGT AGCGAGATCA GGTAGTCCGC GGGATTGGGT 
TGAATCCCGG CGGCGGCGAC CATGGCGTAG TCGCTGAAGC AGTGCCGACC GATATTCATG 
CCCCAATCAA GCCAACAGTC CCCAGGGACT ACCAAAAGAG TGGAAAAGAC GTCGACCGCG 
TACCACTGAC TGAGGGCGTA CGCCGTCGCC GCCGAAATCA CCGACGCCAG CAGGATGGTG 
CCGAGCATGA GGGTGCGCTC GGATTGGGAG CCGATCGCCC AGAGCCGCTC CCGGCTCGCG 
GTCACGGCAC CGCGCAACAC CTCCGGGGGT CGCTTCATCT GGATTCTCCT CGGTTCTGCG 
CGAAACGGTA GCAGAGCGCC ATGGTTGCCA ACGCGGTCGC CGGGCAGTCT AGACCGGATC 
TTCCTCGTGG CAACCGACAA CAGGACGTCG TTGCCGAAAG GGCGCTGGGC ACCGACATCT 
AGGATGAACC CACAGCCACG CCCCGACGTT ATGCCATGGC GAAGAGCGAC CGGCAGGAGC 
GGGAACCCAG TGAAGCGAGC GCTCATCACC GGAATCACAG GACCGGACGG CTCGTATCTC 
GCTAAGCTCC CGCTGAAGGG ATATGTGGCC GCTGGTAGCC CGGCCGAGGT CTATTTCTGC 
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TGGGCGACAC GGAATTATCG CGAATTGTAT GGGTTGCTCG CGGTCAACAG CATCTGGTTC 4020 

AATCACGAAT CACCGCGTCA CGGCGAGACA TTCATGACTC GTAATCCTGC ACCATATCGC 4080 

GGTCGGCAAC GAGGCGCTGA TCGATGCGCA GACGCTGATG CX3CCGGCCCA CCCGGATAGG 4140 

TATCAGTATT GGGGCGTTCC GGCCAGCX3TA CGAGGCGTGA TCX3ACCX3CGC AATGGGTGTT 4200 

TGCGTTGAGT AATAATCTGA ACCGTGTGAA CGCATGCATG GATGGATTCC TTGCCCGTAT 4260 

CCGCTCACAT GTTGATGCGC ACGCGCCAGA ATTGCGTTCA CTGTTCGATA CGATGGCGGC 4320 

CGAGGCCCGA TTTGCACGCG ACTGGCTGTC CGAGGACCTC GCXSCGGTTGC CTGTCGGTGC 4380 

AGCATTGCTG GAAGTGGGCG GGGGGGTACT TCTGCTCAGC TGTCAACTGG CGGCGGAGGG 4440' 

ATTTGACATC ACCGCCATCG AGCCGACGGG TGAAGGTTTT GGCAAGTTCA GACAGCTTGG 4500 

CGACATCGTG CTGGAATTGG CTGCAGCACG ACCCACCATC GCGCCATGCA AGGCGGAAGA 4560 

CTTTATTTCC GAGAAGCGGT TCGACTTCGC CTTCTCGCTG AATGTGATGG AGCACATCGA 4620 

CCTTCCGGAT GAGGCAGTCA GGCGGGTATC GGAAGTGCTG AAACCGGGGG CCAGTTACCA 4680 

CTTCCTGTGC CCGAATTACG TATTCCCGTA CGAACCGCAT TTCAATATCC CAACATTCTT 4740 

C7VCCAAAGAG CTGACATGCC GGGTGATGCG ACATCGCATC GAGGGCAATA CGGGCATGGA 48 00 

TGACCCGAAG GGAGTCTGGC GTTCGCTCTU^ CTGGATTACG GTTCCCAAGG TGAT^CGCTT 4860 

TGCGGCGAAG GATGCGACGC TGACCTTGCG CTTCCACCGT GCAATGTTGG TATGGATGCT 4920 

GGAACGCGCG CTGACGGATA AGGAATTCGC TGGTCGCCGG GCACAATGGA TGGTCX3CTGC 4980 

TATTCGCTCG GCGGTGAAAT TGCGTGTGCA TCATCTGGCA GGCTATGTTC CCGCTACGCT 5040 

GCAGCCCATC ATGGATGTGC GGCTAACGAA GAGGTAATGA CATGGCGCAA GCGACATCGG 5100 

GCATTCGCGC GGCACTTTCG CAACCTGCTG TGTATGAGGC GTATCAGCGG ATTGCGGGCG 5160 

CTAAAAGCGG GCTTGCGTGG ATCACAACCG ACCCCATCCA GTCGTTGCCA GGCATGCGTA 5220 

CTCTCGACCT CGGTTGCTGG CCAGCGGTGA TACACAGCTC CCCGCCAGTG GACGTGACAT 5280 

GTACGAGAGA CGGCATGAGC GCGGAATGTG CGACCGTGCC GTCGAGATGA CCGACGTCGG 5340 

CGCTACGGCA GCCCCCACCG GACCTATCGC GCGGGGCAGC GTCGCTCGGG TCGGCGCGGC 5400 

GACCGCGTTG GCCGTTGCCT GCGTCTACAC GGTCATCTAT CTGGCGGCCC GCGACCTACC 5460 

CCCGGCTTGT TTTTCGATAT TCGCGGTGTT TTGGGGGGCG CTCGGCATTG CCACCGGCGC 5520 

CACCCACGGC CTCCTGCAAG AAACGACCCG CGAGGTCCGC TGGGTGCGCT CCACCCAAAT 5580 

AGTTGCGGGC CATCGTACCC ATCCGCTGCG GGTGGCCGGG ATGATTGGCA CCX3TCGCGGC 5640 
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CGTCGTAATT GCGGGTAGCT CACCGCTGTG GAGCCGACAG CTATTCGTCG AGGGGCGCTG 
GCTGTCCGTG GGGCTACTCA GCGTTGGGGT GGCCGGGTTC TGCGCGCAGG CGACCCTGCT 
GGGCGCGCTG GCCGGCGTCG ACCGGTGGAC ACAGTACGGG TCACTGATGG TGACCGACGC 
GGTCATCCGG TTGGCGGTCG CCGCGGCAGC GGTTGTGATC GGATGGGGTC TGGCCGGGTA 
CTTGTGGGCC GCCACCGCGG GAGCGGTGGC GTGGCTGCTC ATGCTGATGG CCTCGCCCAC 
CGCGCGCAGC GCGGCCAGCC TGCTGACGCC CGGGGGAATC GCCACGTTCG TGCGCGGTGC 
CGCTCATTCG ATAACCGCCG CGGGTGCCAG CGCGATTCTG GTAATGGGTT TCCCAGTGTT 
GCTCAAAGTG ACCTCCGACC AGTTAGGGGC AAAGGGCGGA GCGGTCATCC TGGCTGTGAC 
CTTGACGCGT GCGCCGCTTC TGGTCCCACT GAGCGCGATG CAAGGCAACC TGATCGCGCA 
TTTCGTCGAC CGGCGCACCC AACGGCTTCG GGCGCTGATC GCACCGGCGC TGGTCGTCGG 
CGGCATCGGT GCGGTCGGGA TGTTGGCCGC AGGGCTTACC GGTCCCTGGT TGCTGCGTGT 
TGGATTCGGC CCCGACTACC AAACTGGCGG GGCGTTGCTG GCCTGGTTGA CGGCAGCGGC 
GGTAGCTATC GCCATGCTGA CGCTGACCGG CGCCGCCGCG GTCGCGGCCG CACTGCACCG 
GGCGTATTTG CTGGGCTGGG TCAGCGCGAC GGTGGCGTCG ACGCTGTTGC TGCTGCTGCC 
GATGCCGCTG GAGACGCGCA CCGTGATCGC GCTGTTGTTC GGTCCAACGG TGGGAATCGC 
CATCCATGTG GCCGCGTTGG CGCGGCGACC CGACTGATTT GTGCCCCAGG TCGACAAATC 
ACGCCGTCTC GTCAGTGAGC ACTCCGTCCT CGGGTCCGAT CCTTCCAGGA GACGTTGCAA 



CCTGATTTGG CTCAT^TTGG TGCGCACCGA GGGTCGGGCA CATCGTAGGG TCGCAACAGT 
CACATGTGTC ACTGCACCGG GCGACACCCG ATGTCCCGGC TCTCAGCGAC AGCTGTCTGA 
CCTGTGGTTT TGTTCCCAAG TTGGTCGTGG CTGTGCGGGA TTGGAGGTGG CGTGGGGGTC 
GCGTCGTATG GATTCTCCTC CTCGGTTCCG CGCGAAACGG CCGCAGGCGC AATGGTCACC 
AACTTGGCCG CGGTGGAGTC TAGCCTCACA TTTTCCTGGT CGCCCCCGAC AACCAGGAGG 
TCGCTGCAGA ACGGGCGTTC CCTACCCACA TCTACTATGA AGC6ACAGCG GCGCCCCGCT 
GTGATGGCTG AGCATGACCG ACAGAGGCGG GAAGACAGTG T^GCGAGCGC TCATCACCGG 
AATCACCGGC CAGGACGGCT CGTATCTCGC CGAACTGCTG CTGGCCAAGG GGTATGAGGT 
TCACGGGCTC ATCCGGCGCG CTTCGACGTT CAACACCTCG CGGATCGATC ACCTCTACGT 
CGACCCGCAC CAACCGGGCG CGCGGCTGTT TCTGCACTAT GGTGACCTGA TCGACGGAAC 
CCGGTTGGTG ACCCTGCTGA GCACCATCGA ACCCGACGAG GTGTACAACC TGGCGGCGCA 
GTCACACGTG CGGGTGAGCT TCGACGAACC CGTGCACACC GGTGACACCA CCGGCATGGG 
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ATCCATGCGA CTGCTGGAAG CCGTTCGGCT CTCTCGGGTG CACTGCCGCT TCTATCAGGC 
GTCCTCGTCG GAGATGTTCG GCGCCTCGCC GCCACCGCAG AACGAGCTGA CGCCGTTCTA 
CCCGCGGTCA CCGTATGGCG CCGCCAAGGT CTATTCGTAC TGGGCGACCC GCAATTATCG 
CGAAGCGTAC GGATTGTTCG CCGTTAACGG CATCTTGTTC AATCACGAAT CACCGCGGCG 
CGGTGAGACG TTCGTGACCC GAAAGATCAC CAGGGCCGTG GCACGCATCA AGGCCGGTAT 
CCAGTCCGAG GTCTATATGG GCAATCTGGA TGCGGTCCGC GACTGGGGGT ACGCGCCCGA 
ATACGTCGAA GGCATGTGGC GGATGCTGCA GACCGACGAG CCCGACGACT TCGTTTTGGC 
GACCGGGCGC GGTTTCACCG TGCGTGAGTT CGCGCGGGCC GCGTTCGAGC ATGCCGGTTT 
GGACTGGCAG CAGTACGTGA AATTCGACCA ACGCTATCTG CGGCCCACCG AGGTGGATTC 
GCTGATCGGC GACGCGACCA AGGCTGCCGA ATTGCTGGGC TGGAGGGCTT CGGTGCACAC 
TGACGAGTTG GCTCGGATCA TGGTCGACGC GGACATGGCG GCGCTGGAGT GCGAAGGCAA 
GCCGTGGATC GACAAGCCGA TGATCX3CCGG CCGGACATGA ACGCGCACAC CTCGGTCGGC 
CCGCTTGACC GCGCGGCCCG GGTCTACATC GCCGGGCATC GCGGCCTGGT CGGGTCCGCG 
CTGCTACGCA CGTTTGCGGG CGCGGGGTTC ACCAACCTGC TGGTGCGGTC ACGCGCCGAG 
CTTGATCTGA CGGATCGGGC CGCGACGTTC GACTTCGTTC TCGAGTCGAG GCCGCAGGTC 
GTCATCGACG CGGCGGCCCG GGTCX3GCGGC ATCCTGGCCA ACGACACCTA CCCGGCCGAT 
TTCCTGTCGG AAAACCTCCA GATCCAGGTC AACCTGCTGG ATGCCGCCGT GGCGGCGCGG 
GTGCCGCGGC TGCTGTTCCT GGGCTCGTCG TGCATCTACC CGAAACTCGC CCCGCAGCCG 
ATCCCGGAGA GCGCGCTGCT CACCGGTCCG TTGGAGCCGA CCAACGACGC GTACGCGATC 
GCCAAAATCG CCGGCATCCT TGCX3GTCCAG GCGGTGCGCC GCCAACATGG CCTGCCGTGG 
ATCTCGGCGA TGCCCACCAA CCTGTACGGG CCAGGCGACA ACTTTTCGCC GTCCGGCTCG 
CATCTGCTGC CGGCACTCAT CCGCCGCTAT GACGAGGCCA AAGCCAGTGG CGCGCCCAAC 
GTGACCAACT GGGGCACCGG CACGCCCCGA CGGGAGTTGC TGCACGTCGA CGACCTGGCG 
AGCGCATGCC TGTATCTGCT GGAACATTTC GACGGGCCGA CCCATGTCAA CGTGGGAACC 
GGCATCGACC ACACCATCGG CGAGATCGCC GAGATGGTCG CCTCGGCGGT AGGCTATAGC 
GGCGAAACCC GCTGGGATCC AAGCAAACCG GACGGAACAC CACGCAAACT GCTGGATGTT 
TCGGTGCTAC GGGAGGCGGG ATGGCGGCCT TCGATCGCGC TGCGCGACGG CATCGAGGCG 
ACGGTGGCGT GGTATCGCGA GCACGCGGGA ACGGTTCGGC AATGAGGCTG GCCCGTCGCG 



7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 



4 



WO 99/54487 



PCT/tB99/00740 



7 



CTCGGAACAT CTTGCGTCGC AACGGCATCG AGGTGTCGCG CTACTTTGCC GAACTGGACT 
GGGAACGCAA TTTCTTGCGC CAACTGCAAT CGCATCGGGT CAGTGCCGTG CTCGATGTCG 
GGGCCAATTC GGGGCAGTAC GCCAGGGGTC TGCGCGGCGC GGGCTTCGCG GGCCGCATCG 
TCTCGTTCGA GCCGCTGCCC GGGCCCTTTG CCGTCTTGCA GCGCAGCGCC TCCACGGACC 
CGTTGTGGGA ATGCCGGCGC TGTGCGCTGG GCGATGTCGA TGGAACCATC TCGATCAACG 
TCGCCGGCAA CGAGGGCGCC AGCAGTTCCG TCTTGCCGAT GTTGAAACGA CATCAGGACG 
CCTTTCCACC AGCCAACTAC GTGGGCGCCC AACGGGTGCC GATACATCGA CTCGATTCCG 
TGGCTGCAGA CGTTCTGCGG CCCAACGATA TTGCGTTCTT GAAGATCGAC GTTCAAGGAT 
TCGAGAAGCA GGTGATCGCG GGTGGCGATT CAACGGTGCA CGACCGATGC GTCGGCATGC 
AGCTCGAGCT GTCTTTCCAG CCGTTGTACG AGGGTGGCAT GCTCATCCGC GAGGCGCTCG 
ATCTCGTGGA TTCGTTGGGC TTTACGCTCT CGGGATTGCA ACCCGGTTTC ACCGACCCCC 
GCAACGGTCG AATGCTGCAG GCCGATGGCA TCTTCTTCCG GGGCAGCGAT TGACGCGCCG 
GCGCGTCAAT CTATTTCGAC ATTCGCGTGA AGACGTTTTC CCAGAATCGA CTGTTGTAGG 
CGTAGAACTC CCGGCCGCGT AGGTAGGCAT GTGATATTCG CCTTCCCCCG AACGGGTAGC 
GGCGATGAAG GTCGCCCATG CGGCGCAGAT CACCGAAGAC CGCGCTTGGT TCCCGGTGCG 
AGCCGACGCC CGTGGTGTCG AACTCGCACA GCACACACCG AATCGTGACC GGCTCGCATA 
CCAGCGCGGC CCGCAATATG AATTCCTGGT CGGCGGCGAT CCCGAAATCA AGGTCGTAGC 
CACCGATCTT GGCCACCAGC GATGATCCGA AGAACGATGC TTGATGCGGA ACAACCTGCT 
TGCCGGCCAG GAATTTGCGC AGGCTGAAAG GTATCGGGCC GCGCACCCGA TCGAGCCCGA 
CGAGACGATC CATCCCGAAG CCCCACAATT CGGACACCGG TCCCTTGCCG GATAGCGCCT 
CCACGGCCTG GGCTACCACG TCGGGCCCGG AAAAACGATC GGCGGAGTGC AAGAACCACA 
ACAGATCACC CGATGCGTGC GCGATGCCCT GGTTCATCGC GTCGTACCGC CCGCCGTCGG 
GCTCGGACTG CCAATACGCG AAGCCTGGTT CACACCCGGA CAGGTATGCC ACCACGTCGT 
CGCCGCTGCC ACCGTCGATT ACGATGTGCT CGATGCGTGC CCGGTAGCGT TGCGCCCGCA 
CACTTTTCAC CGTGCGCTGC AACCCGTCGA GGTCGTTGAA CGAGATCGTT ATCACCGAGA 
CGGTCGGAGC AGACGTCACC GAGTTCCCCT AGGTTGCTGG CGGCGATTGT GGATCACCGG 
GTCTTGATAC CGATGAAGGT GCCTCGAAGA TTCGCCGCAT AGGAACCTCC GAGCAACGAC 
TCGGCGATGC TTGGTTCCAA GTTGTCGTAC TCCTCCATCA CCAGGTCGAC GCCGACGTCT 
TTGATGGCCT GAAGTAGGTG CTCGCGTTGA ATCCAGAATG ACCGGCGATT GTCCCAGGAC 
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GCCCATTTTG CGGTGTCGCG CTGGCCAAAC 
CTACCGGGAA GTCCCTCATG TTCGGTGGGC 
CGCCGCAGC7V ACCGATCGGT CAATTGTCGT 
CTCCGGTTGA TGATCGAGAA GTGCGTCTGG 
AGGGTTTCCA GGTATTGCTT CGGATTCTCC 

I GTATCX3AAGA GCCCGTGGTT GGCGATGTTG 

I 

GGCAGGTTGG TCTTCGATTT GATGTAGTTG 

] ATCCCGAGGA CCTGAAATCC CATGCGCGCG 

I 

CCGACATCGG CCAGGCGTAG GTGGCTCTTG 

I 

I CGCGCCGAGA TGAACCAGGA CGATTCGTCT 

I 
I 

GTTCCGTCGT CGAGGCGAAC GTTGTGGGCG 
TGTGCCATCA CTTGGTTAGC CCCTTCGGCT 
GACAGCCGCG TCGGAGCCGG GAGGGCCACA 
CGGCGGCGAG CTTGCGCCTIC TGGGTCTTGA 
GCAGCGCCAG CATGGCGATC CGGGGATGGC 
CCTCCGGGCC TGGAACGTTG GCGATCTGGC 
GCTCCTCGCG CGCGGGGTCT CCCGGGAACA 
GCCCCGGACC CTGCGTCAAC CAGAACTTTG 
CTTGGGCGTG GCGGCGATAC ACGGCCATGG 
CCGCX3TGCCG GACGTGCAGA TACCAGTCCA 
AGCGCTCGAG GCGACGGTAC ACGGCCGAGT 
CATCCAGGCT CAAGTTGCCC CGCACCCGAA 
CGTCCTCCCA TATCACTCX3G ACGGGATGGA 
GGAATGCGAC CTGTTTGCTT AGCTTCAGCG 
ACGCGACGTA CTCGCCGCGA GCGGCCGACA 
GGTTTTCGGT CCTGAAGATC GGCCGGAACA 
TGATCGCCGG GGTGGCATCG GTCGACX5CGT 
CGGTTTGCTG GTCGAGAAAG CTGTCGAAGG 
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GAGCGGTCGT CGGAAAACTC GGTAAACCAC 10860 

GCCGAGAGCA TGAACTTCAC CGGCGCCGGC 10920 

GCCGTCGTGG GCAACCGGAG CCATTTATCG 10980 

AGAATCAGCA GCTTGTTCGT TACCGACGAG 11040 

AGGTGGTAGA AGAGGCCGCA GCAGAAGACG 11100 

AGGGCGTTGT CGTGGACGAA CCGGAGATTC 11160 

CAGGCCGCCA TGTTCAGCTC GCGAACCTCG 11220 

AACCCGACCG CGTACCCGCC TTCCAAGCAG 11280 

TCCCCGGGAA AGACGGTTTC CAGAATCCCG 11340 

AACGTGCGCG AGGACTCCGG TATCGTCAAG 11400 

GTGAATTGTA CGGCGCCGGC CGAATGTTCC 11460 

GGTCCTGGGT TTGTCGACAT GGTCAGGCTC 11520 

CATCCACGAG CCCCCTGCGG CTCGGCGTCG 1158 0 

GCCGCCGCGC GGGTGTCGCC CCGCGGTGCT 11640 

GCGCGATGGT TTCCTGCAGC GCGGCGCGCC 11700 

GT^GGATCCA GTCGGCCATG ACGGCGATGA 11760 

GGTCGAGCAT CGCGTCAAAC GTCGCCGCAT 11820 

GCGGGTCCAC CACCTGGTTG TGCCACATGC 11880 

TGTCGGGCAA CATGGCGATG TCGCCATGCA 11940 

GGGGCATGAC GTCGGCAGGA ATGTCGTCGT 12000 

TGGTCTGGAT GAAGTTCATC AAGATCAACG 12060 

CCGGGGGGAA CTTCGAGTCC TTGGCATGGC 12120 

AGCACACCGT CGTCTTGGGG TGCCGGTCGA 12180 

GATCGATCCA GTAGTCGTCC GCCTCGCACA 12240 

GGGCGCCGGT CAGGTTCCCA TTGAGGCCGA 12300 

CGTGCGGGTA CCGCTCGGCG TACTCACGGA 12 360 

CGTCGGCGAC GATGATCTCC ACCGGGAAGT 12420 

CCTGACGGGC GTAGCCCGCC TGGTTGTGAG 12480 
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TGGTCGAGAC GATGCTCACC TTGGGGCAAA GCTGGGGACT CACCGTCGGC CCTTTTCCTG 12540 

CGCGGCCGCA AGGGTATTGC GATGGCGAAC GTGAATCGCC TGTGCCCGCC GGCCGTCGGC 12600 

CGTCGTGGCC TGGTGGTCGG CGGACGTACG GCACACGCTG GCGAAGTATA GCGAGGGTGC 12660 

ACTGACGTTG GGCTCGAACC GCGTGGCGCG CGGTGTGGGC GCACCGTCTC GAGTCGGTGC 12720 

TGGTTGGCTC GC 12732 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 89 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOIiOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
ATACTCAAGC TTGCCGCT^T CGAAACCAAC CTGTTTGTGC CGCAAGAAAT TACGCCGTGG 60 
CCCGGCGCCG ATCAAGAAAC GCCCCGGCGC GCGGCGGTGT CGTCGTATGG CATGACGGGC 120 
ACCAATGTGC ACGCCATTGT CGAGCAGGCA CCGGTGCCAG CCCCCGAATC CGGTGCACCA 180 
GGCGACACCC CGGCCACACC CGGTATCGAC GGCGCGCTGC TGTTCGCGCT GTCGGCCAGC 240 
TCGCAGGACG CGCTGCGGCA AACCGCCGCG CGGCTGGCCG ATTGGGTCT 289 



(2) INFORMATION FOR SEQ ID NO: 3: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 278 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3: 
TTGGCGGGTT GGCCACACAC CCGCCGGTGA CGGCGACGAT GCTGGGCTGG TTGCGGCCCT 60 
GCGCCACCGC GGCTTGCATG CTGGTTGGCT GTCTTGGGAC GATCCCG7U\A TAGTCCACGC 120 
GGATCTGGTG ATTTTGCGGG CTACCCGCGA TTACCCCGCG CGGCTCGACG AGTTTTTGGC 180 
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CTGGACTACC CGCGTGGCCA ATCTGCTGAA CTCGCGGCCG GTGGTGGCCT GGAATGTCCA 240 

CGCCGTTCAC CTACGTGACC TTGATGGGAT CCGGGGGT 278 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1280 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

CCGACCCAGA CACTGACCGG GCGACCGCTG ATCGGCAACG GCACCCCCGG GGCGGTCGGC 60 

AGCGGGGCCA CCGGGGCCCC CGGTGGGTGG CTGCTCGGCG ACGGCGGGGC CGGCGGGTCC 120 

GGCGCGGCGG GCTCGGGCGC GCCCGGCGGG GCGGGCGGGG CTGCCGGGCT GTGGGGTACC 180 

GGCGGGGCCG GCGGGATCGG CGGAGCCAGC ACCGTACTCG GCGGCACCGG CGGGGGAGGC 24 0 

GGGGTCGGTG GGCTGTGGGG CGCCGGTGGG GCCGGCGGGG CCGGTGGAAC CGGCCTTGTT 3 00 

GGTGGCGACG GCGGGGCCGG TGGGGCCGGC GGGACCGGCG GACTGCTGGC CGGGCTGATC 360 

GGTGCCGGCG GAGGTCACGG CGGGACCGGC GGGCTCAGCA CTAATGGCGA CGGCGGGGTT 420 

GGCGGGGCCG GCGGGAATGC CGGAATGCTC GCCGGGCCGG GCGGCGCCGG CGGAGCCGGC 480 

GGTGACGGCG AAAACCTGGA CACCGGTGGG GACGGCGGGG CCGGCGGTAG CGCAGGGCTG 54 0 

CTGTTCGGCA GCGGCGGCGC CGGCGGCGCC GGCGGATTTG GTTTCCTCGG TGGGGACGGC 600 

GGGGCCGGTG GCAACGCCGG GCTGCTGTTG TCCAGCGGCG GGGCCGGCGG GTTCGGCGGG 660 

TTCGGCACCG CCGGTGGGGT CGGTGGGGCC GGCGGCAATG CCGGCTGGCT GGGCTTCGGC 72 0 

GGGGCCGGGG GCATCGGCGG AATCGGCGGT AACGCTAACG GGGGCGCCGG TGGGAACGGC 78 0 

GGCACCGGCG GTCAGTTATG GGGTAGCGGC GGCGCCGGCG TCGAAGGCGG CGCAGCCTTA 84 0 

AGCGTCGGCG ACACCG6CGG GGCCGGTGGC GTCGGCGGCA GCGCCGGGCT GATCGGCACC 900 

GGCGGCAACG GCGGCAACGG CGGCACCGGC GCCAACGCCG GCAGCCCCGG AACCGGCGGC 960 

GCCGGCGGGT TGCTGCTGGG CCAAAACGGG CTCAACGGGT TGCCGTAGCC GGGCGGCACG 1020 

GCATGGCTTC CGGGCGTCAA CCACTCGCCG GTGATGCAGA TCGGCTGCGG AGCGGGCCGC 1080 
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CAAAATGGGG GCCGCCGCGC CAGGTATCTC GGCGAAGATC CCCGGCGCTC GAGCGCTTTG 1140 



TCAGAGGCCC GTCGCGGGTC GTCGTGACGA CGGCTATCCG GGCGGTGCGG GTTTCGCGGC 1200 



GCGCCCTGTG CCCGGCACCG CCGCCCGTTT GTCGGCAACG CCGCCGCGAC CCGTGAGCCG 1260 
TCCAGCAGCT GGCGCCTGCG 1280 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



GGGCATCGGC GGAATCGGCG GTAACGCTAA CGGGGGCGCC GGTGGGAACG GCGGCACCGG 60 
CGGTCAGTTA TGGGGTAGCG GCGGCGCCGG CGTCGAAGGC GGCGCAGCCT TAAGCGTCGG 12 0 



CGACACC 



127 



